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This paper is devoted to the performance study of the Linear Minimum Mean Squared Error estimator 
for multidimensional signals in the large dimension regime. Such an estimator is frequently encountered 
in wireless communications and in array processing, and the Signal to Interference and Noise Ratio 
(SINR) at its output is a popular performance index. The SINR can be modeled as a random quadratic 
form which can be studied with the help of large random matrix theory, if one assumes that the dimension 
qq ■ of the received and transmitted signals go to infinity at the same pace. This paper considers the asymptotic 
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behavior of the SINR for a wide class of multidimensional signal models that includes general multi- 
antenna as well as spread spectrum transmission models. 

The expression of the deterministic approximation of the SINR in the large dimension regime is recalled 
and the SINR fluctuations around this deterministic approximation are studied. These fluctuations are 
shown to converge in distribution to the Gaussian law in the large dimension regime, and their variance 
is shown to decrease as the inverse of the signal dimension. 

Index Terms 

Antenna Arrays, CDMA, Central Limit Theorem, LMMSE, Martingales, MC-CDMA, MIMO, Ran- 
dom Matrix Theory. 



(1) Telecom ParisTech (ENST), Paris, France, abla . kammoun@enst . f r 

(2) Telecom ParisTech (ENST) and Casablanca University, Morocco, malika . kharoufSenst . fr 

(3) CNRS / ENST, Paris, France, walid.hachem, jamal .najim@enst.fr 



October 10, 2008 



DRAFT 



A CENTRAL LIMIT THEOREM FOR THE SINR AT THE LMMSE ESTIMATOR OUTPUT 



2 



I. Introduction 



Large Random Matrix Theory (LRMT) is a powerful mathematical tool used to study the performance 
of multi-user and multi-access communication systems such as Multiple Input Multiple Output (MIMO) 
digital wireless systems, antenna arrays for source detection and localization, spread spectrum commu- 
nication systems as Code Division Multiple Access (CDMA) and Multi-Carrier CDMA (MC-CDMA) 
systems. In most of these communication systems, the N dimensional received random vector r G 
is described by the model 



where s = [so,si, . . . ,sk] T is the unknown random vector of transmitted symbols with size K + 1 
satisfying Ess* = Ir+i, the noise n is an independent Additive White Gaussian Noise (AWGN) with 
covariance matrix Enn* = pljv whose variance p > is known, and matrix ~E represents the known 
"channel" in the wide sense whose structure depends on the particular system under study. One typical 
problem addressed by LRMT concerns the estimation performance by the receiver of a given transmitted 
symbol, say so. 

In this paper we focus on one of the most popular estimators, namely the linear Wiener estimator, also 
called LMMSE for Linear Minimum Mean Squared Error estimator: the LMMSE estimate so = g*r of 
signal so is the one for which the N x 1 vector g minimizes E|so — so| 2 . If we partition the channel 
matrix as ~E = [y Y] where y is the first column of £ and where matrix Y has dimensions N x K, 
then it is well known that vector g is given by g = (SS* + pLv) -1 y. Usually, the performance of this 
estimator is evaluated in terms of the Signal to Interference plus Noise Ratio (SINR) at its output. Writing 
the received vector r as r = soy + r; n where soy is the relevant term and r; n represents the so-called 
interference plus noise term, the SINR is given by (3k = |g*y| 2 /E|g*ri n | 2 . Plugging the expression of g 
given above into this expression, one can prove that the SINR (3k is given by the well-known expression: 



In general, this expression does not provide a clear insight on the impact of the channel model parameters 
(such as the load factor KN^ 1 , the power distribution of the transmission data streams, or the correlation 
structure of the channel paths in the context of multi-antenna transmissions) on the performance of the 
LMMSE estimator. 

An alternative approach, justified by the fluctuating nature of the channel paths in the context of 
MIMO communications and by the pseudo-random nature of the spreading sequences in spread spectrum 



r = Ss + n 



(1) 



(3 K = y* (YY* + pIn)' 1 y . 



(2) 
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applications consists to model matrix 5] as a random matrix (in this case, (3k becomes a random 
SINR). The simplest random matrix model for X, corresponding to the most canonical MIMO or CDMA 
transmission channels, corresponds to independent and identically distributed (i.i.d.) entries with mean 
zero and variance iV -1 . In that case, LRMT shows that when K — > oo and the load factor KN^ 1 
converges to a limiting load factor a > 0, the SINR 0k converges almost surely (a.s.) to an explicit 
deterministic quantity 0(a, p) which simply depends on the limiting load factor a and on the noise 
variance p. As a result, the impact of these two parameters on the LMMSE performance can be easily 
evaluated [1], [2]. 

The LMMSE SINR large dimensional behavior for more sophisticated random matrix models has also 
been thoroughly studied (cf. [1], [3]-[9]) and it has been proved that there exists a deterministic sequence 
(f3 K ), generally defined as the solution of an implicit equation, such that 0k — k ~~ * almost surely 
as K — > oo and |£ remains bounded away from zero and from infinity. 

Beyond the convergence (3k ~ k — ► 0, a natural question arises concerning the accuracy of (3 K for 
finite values of K. A first answer to this question consists in evaluating the Mean Squared Error (MSE) 
of the SINR K\(3k — (3k\ 2 f° r large K. A further problem is the computation of outage probability, that is 
the probability for (3k — (3k to be below a certain level. Both problems can be addressed by establishing 
a Central Limit Theorem (CLT) for (3k — (3k- m this paper, we establish such a CLT (Theorem [3] below) 
for a large class of random matrices X!. We prove that there exists a sequence @ 2 K = 0(1) such that 
j^-{Pk — k) converges in distribution to the standard normal law JV(0, 1) in the asymptotic regime. 

Q2 

One can therefore infer that the MSE asymptotically behaves like and that the outage probability can 
be simply approximated by a Gaussian tail function. 

The class of random matrices S we consider in this paper is described by the following statistical 
model: Assume that 



where the complex random variables W n k are i.i. d. with EW nk = 0, Elf n \ = and E|W nfc | 2 = 1 and 
where {a 2 nk \ l<n<N; < k < K) is an array of real numbers. Due to the fact that E|X n /%| — -fr-, 




(3) 



the array (a^k) ^ s re f erre d to as a variance profile. An important particular case is when a\ 



is separable, 



that is, writes: 



(4) 



where (d\, . . . , cZjv) and (do, . . . , d^) are two vectors of real positive numbers. 
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Applicative contexts. 

Among the applicative contexts where the channel is described appropriately by model © or by its 
particular case (01), let us mention: 

• Multiple antenna transmissions with K + 1 distant sources sending their signals toward an array of 
N antennas. The corresponding transmission model is r = Hs + n where S = ^HP 1 / 2 , matrix 
H is a N x (if + 1) random matrix with complex Gaussian elements representing the radio channel, 
P = diag(poj-- - ,Pk) is the (deterministic) matrix of the powers given to the different sources, 
and n is the usual AWGN satisfying Enn* = pljy. Write H = [ho • • • h/<], and assume that the 
columns are independent, which is realistic when the sources are distant one from another. Let 

be the covariance matrix = Ehfch*, and let = U^A^U/,. be a spectral decomposition of 
where A& = diag(A n/ t; 1 < n < N) is the matrix of eigenvalues. Assume now that the eigenvector 
matrices Uo, • • • , are all equal (to some matrix U, for instance), a case considered in e.g. [10] 
(note that sometimes they are all identified with the Fourier N x N matrix [11]). Let X! = U*H. 
Then matrix X is described by the statistical model (0 where the W n k are standard Gaussian i.i.d., 
and cr 2 fc = X n kPk- If we partition S as 3 = [x X] similarly to the partition E = [y Y] above, 
then the SINR (3 at the output of the LMMSE estimator for the first element of vector s in the 
transmission model r = Ss + n is 

= x* (XX* + pIn)- 1 x = y* (YY* + pI N y l y 

due to the fact that U is a unitary matrix. Therefore, the problem of LMMSE SINR convergence 
for this MIMO model is a particular case of the general problem of convergence of the right-hand 
member of (f2]) for model (f3]). 

It is also worth to say a few words about the particular case (0]) in this context. If we assume that 
Ao = • • • = Ak and these matrices are equal to A = diag(Ai, . . . , Ajv), then the model for H is 
the well-known Kronecker model with correlations at reception [12]. In this case, 

S = ITH = — LlTHP 1 / 2 = — La^WP 1 / 2 (5) 

y/K sfK 

where W is a random matrix with iid standard Gaussian elements. This model coincides with the 
separable variance profile model dU with d n = X n and dk = Pk- 

• CDMA transmissions on flat fading channels. Here is the spreading factor, K + 1 is the number 
of users, and 

S = VP 1 / 2 (6) 
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where V is the N x (K + 1) signature matrix assumed here to have random i.i.d. elements with 
mean zero and variance N^ 1 , and where P = diag(po) • • • ,Pk) is the users powers matrix. In this 
case, the variance profile is separable with d n = 1 and = ^Pk- Note that elements of V are not 
Gaussian in general. 

• Cellular MC-CDMA transmissions on frequency selective channels. In the uplink direction, the 
matrix E is written as: 

E = [H v • • • H K+1 v K+1 ] , (7) 

where = diag(/ifc(exp(2z7r(n — 1)/N); 1 < n < N) is the radio channel matrix of user k (i = 
V— 1) in the discrete Fourier domain (here N is the number of frequency bins) and V = [vq, • • • , Vr] 
is the N x (K + 1) signature matrix with i.i.d. elements as in the CDMA case above. Modeling this 
time the channel transfer functions as deterministic functions, we have a^ k = -^|/ifc(exp(2z7r(n — 
1)/A0)| 2 . 

In the downlink direction, we have 

£ = HVP 1 / 2 (8) 

where H = diag(/i(exp(2z7r(n — l)/N); 1 < n < N) is the radio channel matrix in the discrete 
Fourier domain, the N x (K + 1) signature matrix V is as above, and P = diag(p , • • • ,Pk) is the 
matrix of the powers given to the different users. Model ([8]) coincides with the separable variance 
profile model (0]) with d n = ^|/i(exp(2z7r(n — l)/iV))| 2 and dk = Pk- 

About the literature. 

The asymptotic approximation (3 K (first order result) is connected with the asymptotic eigenvalue 
distribution of Gram matrices YY* where elements of Y are described by the model ([3]), and can be 
found in the mathematical LRMT literature in the work of Girko [13] (see also [14] and [15]). Applications 
in the field of wireless communications can be found in e.g. [6] in the separable case and in [8] in the 
general variance profile case. 

Concerning the CLT for j3x — @k (second order result), only some particular cases of the general 
model have been considered in the literature among which the i.i.d. case (<r 2 fe = 1) is studied in [16] 
(and based on a result of [17] pertaining to the asymptotic behavior of the eigenvectors of YY*). The 
more general CDMA model © has been considered in [18], using a result of [19]. The model used in 
this paper includes the models of [16] and [18] as particular cases. 
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Fluctuations of other performance indexes such as Shannon's mutual information E log det — h IjvJ 
have also been studied at length. Let us cite [20] where the CLT is established in the separable case and 
[21] for a CLT in the general variance profile case. Similar results concerning the mutual information 
are found in [22] and in [23]. 

Limiting expressions vs K -dependent expressions. 

As one may check in Theorems [2] and [3] below, we deliberately chose to provide deterministic 
expressions K and Q 2 K which remain bounded but do not necessarily converge as K — > oo. For instance, 
Theorem |2] only states that 0k — 0k ~^ almost surely. No conditions which would guarantee the 
convergence of 0k are added. This approach has two advantages: 1) such expressions for K and @ K 
exist for very general variance profiles (<r^L) while limiting expressions may not, and 2) they provide a 
natural discretization which can easily be implemented. 

The statements about these deterministic approximations are valid within the following asymptotic 
regime: 

K K 
K — > oo, liminf — > and lira sup — < oo . (9) 

N N 

Note that ^ is not required to converge. In the remainder of the paper, the notation "K — ► oo" will refer 
to ©. 

We note that in the particular case where £ — > a > and the variance profile is obtained by a regular 
sampling of a continuous function / i.e. a 2 nk = f k^i\ it is possible to prove that K and @ K 
converge towards limits that can be characterized by integral equations. 

Principle of the approach. 

The approach used here is simple and powerful. It is based on the approximation of 0k by the sum 
of a martingale difference sequence and on the use of the CLT for martingales [24]. We note that apart 
from the LRMT context, such a technique has been used recently in [25] to establish a CLT on general 
quadratic forms of the type z*Az where A is a deterministic matrix and z is a random vector with i.i.d. 
elements. 

Paper organization. 

In Section |II1 first-order results, whose presentation and understanding is compulsory to state the CLT, 
are recalled. The CLT, which is the main contribution of this paper, is provided in Section |III] In Section 
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ITVl simulations and numerical illustrations are provided. The proof of the main theorem (Theorem [3]) in 
given in Section |V] while the Appendix gathers proofs of intermediate results. 



Notations. 

liV 



Given a complex N x N matrix X = [xij\^- =1 , denote by ||X|| its spectral norm, and by IXI^ its 
iximum row sum norm, i.e., HXf!^ = maxi 
a vector and by || • its max (or £oo) norm. 



maximum row sum norm, i.e., HXf^ = maxi<j<Ar Y^j=i \ x ij\- Denote by || • || the Euclidean norm of 



II. First Order Results: The SINR Deterministic Approximation 

In the sequel, we shall often show explicitly the dependence on K in the notations. Consider the 
quadratic form (J2J): 

P K = y* (YY* + P I N y l y , 
where the sequence of matrices S(i^) = [y(K) Y(K)] is given by 

£(*) = = (^fW N ' K 



n=l,fc=0 

Let us state the main assumptions: 

Al: The complex random variables (W nk ; n > 1, k > 0) are i.i.d. with EWi = 0, EWfo = 0, 
E|Vy 10 | 2 = 1 and E|l^i | 8 < cxd. 

A2: There exists a real number <7 max < cxd such that 

sup max \a nk {K)\ < cr max . 
K>1 ^ n ^ N 

— 0<k<K 

Let (a m ; 1 < m < M) be complex numbers, then diag(a m ; 1 < m < M) refers to the M x M 
diagonal matrix whose diagonal elements are the a m 's. If A = (a^) is a square matrix, then diag(A) 
refers to the matrix diag(ojj). Consider the following diagonal matrices based on the variance profile 
along the columns and the rows of S: 

T> k (K) = di ag (a 2 lk (K),... ,a% k (K)), < k < K 
B n (K) = di ag (a 2 nl (K) r .. ,a 2 nK (K)), \<n<N. 
A3: The variance profile satisfies 

liminf min — txT) k {K) > . 

K>1 o<k<K K v ; 

Since E|^i | 2 = 1, one has IE|Wi | 4 > 1. The following is needed: 
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A4: At least one of the following conditions is satisfied: 

E| W W \ A > 1 or liminf tr \ T> (K) ^ D fc (2f) J > . 

Remark 1: If needed, one can attenuate the assumption on the eighth moment in Al. For instance, 
one can adapt without difficulty the proofs in this paper to the case where E| Wio | 4+e < oo for e > 0. 
We assumed ElVFio] 8 < °° because at some places we rely on results of [21] which are stated with the 
assumption on the eighth moment. 

Assumption A3 is technical. It has already appeared in [26]. 

Assumption A4 is necessary to get a non- vanishing variance Q 2 K in Theorem [3] 



The following definitions will be of help in the sequel. A complex function t(z) belongs to class S if 
t(z) is analytical in the upper half plane C + = {z G C ; im(z) > 0}, if t(z) G C + for all z G C + and 
if im(z)|i(z)| is bounded over the upper half plane C+. 

Denote by Qk(z) and Qk(z) the resolvents of Y(K)Y(K)* and Y(K)*Y(K) respectively, that is 
the N x N and K x K matrices defined by: 

Q K (z) = (Y(K)Y(K)* - zIn)- 1 and Q K (z) = (Y(K)*Y{K) - zl K y l . 

A. The SINR Deterministic approximation 

It is known [13], [26] that there exists a deterministic diagonal N x N matrix function T(z) that 
approximates the resolvent Q(z) in the following sense: Given a test matrix S with bounded spectral 
norm, the quantity itr S(Q(z) — T(z)) converges a.s. to zero as K —> oo. It is also known that the 
approximation (3 K of the SINR /3k is simply related to T(z) (cf. Theorem [2]). As we shall see, matrix 
T(z) also plays a fundamental role in the second order result (Theorem [3). 

In the following theorem, we recall the definition and some of the main properties of T(z). 

Theorem 1: The following hold true: 

1) [26, Theorem 2.4] Let {a 2 nk {K); 1 < n < N; 1 < k < K) be a sequence of arrays of real 
numbers and consider the matrices Dfc(if) and Y) n (K) defined in (fTOl ). The system of N + K 
functional equations 



t n ,K(z) = — v-, 1 < n < N 

z[l + ^tr[D k (K)T K (z))) 



(11) 
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where 

T K (z) = diag(t ltK (z), ■ ■■ , t N)K (z)), T K (z) = diag(t ltK (z), ... , t K>K (z)) 

admits a unique solution (T, T) among the diagonal matrices for which the t n ,.ftr's and the fair's 
belong to class S. Moreover, functions t n x(z) and k(z) admit an analytical continuation over 
C — R + which is real and positive for z G (—00, 0). 
2) [26, Theorem 2.5] Assume that Assumptions Al and A2 hold true. Consider the sequence of 
random matrices Y(K)Y(K)* where Y has dimensions N x K and whose entries are given by 
Y n k = ^j=W n k- For every sequence S^- of N x N diagonal matrices and every sequence Sa' of 
K x K diagonal matrices with 

supmax ( ||Sjf||, \\Sk\\ ) < 00 , 
k ^ ' 

the following limits hold true almost surely: 

lim — tr S K (Qk(z) - T K (z)) = 0, VzGC-M + , 
lim — tr Sk (Qk{z) - T K (z)) = 0, \/z G C - R + . 

K— >oo J\ V / 



The following lemma which reproduces [27, Lemma 2.7] will be used throughout the paper. It 
characterizes the asymptotic behavior of an important class of quadratic forms: 

Lemma 1: Let x = [X\, . . . , Xn] t be a N x 1 vector where the X n are centered i.i.d. complex random 
variables with unit variance. Let A be a deterministic N x N complex matrix. Then, for any p > 2, there 
exists a constant C p depending on p only such that 

*N\P/2 



E 



^x* Ax - ltr(A; 



P < 9l. ((E|X 1 | 4 tr(AA*)) p/ " + E|X 1 | 2p tr ((AA*f/ 2 



(12) 



Noticing that tr(AA*) < N\\ A|| 2 and that tr ((AA*) p / 2 ) < N\\ A|| p , we obtain the simpler inequality 

Cm 



E 



^x* Ax - ltr(A) 



<^||Af ((ElXxl^ + El^l* 



(13) 



which is useful in case one has bounds on ||A||. 

Using Theorem [T]and Lemma [Q we are in position to characterize the asymptotic behavior of the quadratic 
form 0k given by ©. We begin by rewriting (3k as 

(3 K = ^w^Dj /2 (YY* + pl^)- 1 D^ /2 w = lw^ /2 Q(-p)Dj /2 wo (14) 
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where the N x 1 vector wo is given by wo = [Wio, • • • , Wato] T an d the diagonal matrix Do is given by 
(flOl) . Recall that wo and Q are independent and that ||Do|| < ff^ax by A2. Furthermore, one can easily 
notice that ||Q(-p)|| = ||(YY* + pi)- 1 ]] < 1/p. 

Denote by Eq the conditional expectation with respect to Q, i.e. Eq = E( • ||Q). From Inequality 
(fT3l) . there exists a constant C > for which 



EE 



Q 



p K - -trDoQ(-p) 



< 



c 






(r 


C 



















2 / a 2 . 



((E|W 10 | 4 ) 2 + E|^ 10 | 8 ) 



By the Borel-Cantelli Lemma, we therefore have 

1 



tr(D Q(-p)) ► 

A K— >oo 



a.s. 



Using this result, simply apply Theorem [T]-© with S = Do (recall that ||Do|| < c^max) to obtain: 

Theorem 2: Let j3 K = —tT(Do(K)TK(—p)) where Tk is given by Theorem [l]-©. Assume Al and 
K 

A2. Then 



Pk-P 



K 



K^oo 



a.s. 



B. The deterministic approximation in the separable case 

In the separable case a n k(K) = d n (K)dk(K), matrices Dfc(if) and D n (K) are written as Dfc(if) = 
d k (K)B(K) and D n {K) = d n (K)B(K) where D(iT) and D(A') are the diagonal matrices 

D(A') = diag(di(A'),... ) d iV (A')) ; D(a') = diag(di (A'), . . . , cLk(K)) • (15) 

and one can check that the system of N + K equations leading to TV and Tk simplifies into a system 
of two equations, and Theorem Q] takes the following form: 
Proposition 1: [26, Sec. 3.2] 

1) Assume cr^ k (K) = d n {K)dk{K). Given p > 0, the system of two equations 

6k(j>) = ±tr(T>(p(I N + 5 K (p)T>)^ 1 

(16) 

S K (p) = Ut(b(p(I k + 6 k (p)-D) 
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where D and D are given by (fT51) admits a unique solution (Sk(p), 5k(p))- Moreover, in this case 
matrices T(— p) and T(— p) provided by Theorem [D-CO) coincide with 

T(-p) = -(I + ^(p)D)- 1 and T(-p) = -(I + d^ft)- 1 . (17) 
P P 

2) Assume that Al and A2 hold true. Let matrices Sk and Sk be as in Theorem [l]-©. Then, 
almost surely £tr (S K (Q K (-p) - T K (-p))) and £tr (s K (Qk(~p) - T K (-p))) 
as K — > co. 



With these equations we can adapt the result of Theorem[2]to the separable case. Notice that Do = c?oD 
and that 5(p) given by the system (fT6T > coincides with ^tr(DT), hence 

Proposition 2: Assume that o\ k {K) = d n {K)dk{K), and that Al and A2 hold true. Then 

-=- — $k(p) ► a.s. 

do K^oo 

where 5k(p) is given by Proposition [T]— ([TJ. 



Let us provide a more explicit expression of 5k which will be used in Section JV] to illustrate the 
SINR behavior for the MIMO Model © and for MC-CDMA downlink Model (HJ). By combining the 
two equations in System (fT6l ), it turns out that 5 = 5k(p) is the unique solution of the implicit equation 

n=0 P+ K Qn 2^k=l l+ Pk 5 

Recall that in the case of the MIMO model ([5]), d n = X n and dk = Pk, while in the case of the MC- 
CDMA downlink model ([8]), d n = |£|/t(exp(2z7r(n — 1)/A^)| 2 and dk = Pk again. Here do = po is the 
power of the user of interest (user 0), and therefore (3k /do is the normalized SINR of this user. Notice 
that Sk{p) is almost the same for all users, hence the normalized SINRs for all users are close to each 
other for large K. Their common deterministic approximation is given by (TT8T ) which is the discrete 
analogue of the integral equation (16) in [6]. 
This example will be continued in Section |III] 

III. Second order results: The Central Limit Theorem 
The following theorem is the main result of this paper. Its proof is postponed to Section [V] 
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Theorem 3: 1) Assume that A2, A3 and A4 hold true. Let Ak and Ak be the K x K matrices 

K 



1 ^trD £ D m T(-p) 2 
*(l + £trD/T(-p)) : 



1 



1 + -trD,T(-p) 




(19) 



A K = diag 

where T is defined in Theorem [T|-([T]). Let gx be the K x 1 vector 
Sk 



-^trDoDiTl-p) 2 , • • • , ^trV Q V K T{-p) 2 



Then the sequence of real numbers 



P> 2 



A^A^g + (E|^ 10 | 4 - l)-trD 2 T(-p) 2 



(20) 



is well defined and furthermore 



< lim inf Q 2 K < lim sup Q 2 K < oo . 

K K 

2) Assume in addition Al. Then the sequence (5k = y*(YY* + pl)~ 1 y satisfies 

\f~K 



{Pk-Pk) 



Af(0,l) 



in distribution where j3 K = -^trDoTV is defined in the statement of Theorem |2j 
Remark 2: (Comparison with other performance indexes) It is interesting to compare the "Mean 
Squared Error" (MSE) related to the SINR f3 K : MSE([3 K ) = E(f3 K - P K ) 2 , with the MSE related 
to Shannon's mutual information per transmit dimension / = -4 log det(p5]S* +1) (studied in [21], [22] 
for instance): 



MSE(p K ) 



oc 



while MSE(I) 



oc 



Remark 3: (On the achievability of the minimum of the variance) Recall that the variance writes 



P> 2 



1 



+ (E|VT 10 | 4 -l)^trD 2 l T 2 



1 

K 



As E| Wio| 2 = 1, one clearly has E| Wiol 4 — 1 > with equality if and only if |Wio| = 1 with probability 
one. Moreover, we shall prove in the sequel (Section IV-BI ) that liminf^ ^T)q(K)T k > 0. Therefore 
(E|Wio| 4 — l)"^trDQT 2 is nonnegative, and is zero if and only if |Wio| = 1 with probability one. As 
a consequence, Q 2 K is minimum with respect to the distribution of the W n k if and only if these random 
variables have their values on the unit circle. In the context of CDMA and MC-CDMA, this is the case 
when the signature matrix elements are elements of a PSK constellation. In multi-antenna systems, the 
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W n kS are frequently considered as Gaussian which induces a penalty on the SINR asymptotic MSE with 
respect to the unit norm case. 



In the separable case, Q\- = d^fl 2 K where £l K is given by the following corollary. 

'nk 



Corollary 1: Assume that A2 is satisfied and that a 2 k = d n dk- Assume moreover that 



minf liminf — tr(D(K)), liminf — tr(D(K)) ) > (21) 
\ K K K K J 



where D and D are given by (fT3Tl. Let 7 = -^trD 2 T 2 and 7 = -^trD 2 T 2 . Then the sequence 



n\=l{r/X^ + (nW W \*-l)\ (22) 
Vl-/) 2 77 ) 

satisfies < liminf^- Q 2 K < limsup^ £l 2 K < 00. If, in addition, Al holds true, then: 

^ f(3K 5 K ) ^(0,1) 



V do ) K^co 

in distribution. 



Remark 4: Condition (I2T1) is the counterpart of Assumption A3 in the case of a separable variance 
profile and suffices to establish < liminf ^ (1 — p 2 jj) < limsup^(l — /? 2 77) < 1 (see for instance 
[20]), hence the fact that < liminf ^ Q 2 ^ < limsup^ £t 2 K < 00. The remainder of the proof of Corollary 
Q] is postponed to Appendix iBl 

Remark 5: As a direct application of Corollary Q] (to be used in Section ITVl below), let us provide the 
expressions of 7 and 7 for the MIMO Model © or MC-CDMA downlink Model ®. From (IT5b— (TTTb . 
we get 



W-l / 7 \ 2 N-l / , 

a„ \ 1 x -« / a, 



7 



7 



n 



-T. 



PkS . 

K 



Pk 



K \P + PPk$ 



where we recall that d n = X n for Model ©, d n = ^|/i(exp(2z7r(n - l)/N)\ 2 for Model ®, and 5 is 
the solution of (fl8i 

IV. Simulations 

A. The general (non necessarily separable) case 

In this section, the accuracy of the Gaussian approximation is verified by simulation. In order to 
validate the results of Theorems [2] and [3] for practical values of K, we consider the example of a MC- 
CDMA transmission in the uplink direction. We recall that K is the number of interfering users in this 
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context. In the simulation, the discrete time channel impulse response of user k is represented by the 
vector with L = 5 coefficients g^ = [gk,o, ■ ■ ■ ,9k,L-i] T - m the simulations, these vectors are generated 
pseudo-randomly according to the complex multivariate Gaussian law CAA(0, Setting the number 

of frequency bins to N, the channel matrix for user k in the frequency domain (see Eq. ([7])) is 
H fc = diag(/i fc (exp(2i7r(n - l)/N); 1 < n < N) where h k {z) = Ya=o 9k,lZ~K the norm ||g fc || is 
the Euclidean norm of g& and is the power received from user k. Concerning the distribution of the 
user powers we assume that these are arranged into five power classes with powers P, 2P, 4P, 8P and 
16P with relative frequencies given by Table IIV-AI The user of interest (User 0) is assumed to belong 

TABLE I 

Power classes and relative frequencies 



Class 


1 


2 


3 


4 


5 


Power 


P 


2P 


4P 


8P 


16P 


Relative frequency 


1/8 


1/4 


1/4 


1/8 


1/4 



to Class 1. Finally, we assume that the number K of interfering users is set to K = N/2. 

In Figured] the Signal over Noise Ratio (SNR) P/p for the user of interest is fixed to 10 dB. The 
evolution of KK((3k — Pk) 2 l®\ f° r tn i s user (where E(/?^ — fix) 2 is measured numerically) is shown 
with respect to K. We note that this quantity is close to one for values of K as small as K = 8. 
In Figure El K is set to K = 64, and the SINR normalized MSE KE(@ K - /3 K ) 2 /Q 2 K is plotted with 
respect to the input SNR Pj p. This figure also confirms the fact that the MSE asymptotic approximation 
is highly accurate. 

Figure [3] shows the histogram of \^K(Pk — (3k)/®k for N = 16 and N = 64. This figure gives an 
idea of the similarity between the distribution of VK{/3k — I3k)/&k and Af(0, 1). 
More precisely, Figure |4] quantifies this similarity through a Quantile-Quantile plot. 

B. The separable case 

In order to test the results of Proposition [2] and Corollary [T] we consider the following multiple antenna 
(MIMO) model with exponentially decaying correlation at reception: 

5] = _ ij/V^yp 1 / 2 
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I 

5] 0-4 




Fig. 1. SINR normalized MSE vs K 




Fig. 2. SINR normalized MSE vs SNR 
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-4 -2 2 4 -4 -2 2 4 



Fig. 3. Histogram of \fK{f3 K - /3k) for N = 16 and N = 64. 



QQplot for N=1 6 QQplot for N=64 




Empirical Quantiles Empirical Quantiles 



Fig. 4. Q-Q plot for VK((3k - Pk ), N = 16 and N = 64; dash doted line is the 45 degree line. 
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Pu 



where \l/ = [a m_n ]^ n L w i tri < a < 1 is the covariance matrix that accounts for the correlations at 
the receiver side, P = diag (po,--- ,Pk) is the matrix of the powers given to the different sources and 
W is a N x (K + 1) matrix with Gaussian standard iid elements. Let P u denote the vector containing 
the powers of the interfering sources. We set P u (up to a permutation of its elements) to: 

[4P 5P] if K = 2 
[P P 2P 4P] if K = 4 

[P P 2P 2P 2P 4P 4P 4P 8P 16P 16P 16P ] if K = 12 . 
For K = 2 P with 3 < p < 7, we assume that the powers of the interfering sources are arranged into 
5 classes as in Table HV-Al We set the SNR P/p to 10 dB and a to 0.1. We investigate in this section 
the accuracy of the Gaussian approximation in terms of the outage probability. In FigJ5J we compare 
the empirical 1% outage SINR with the one predicted by the Central Limit Theorem. We note that the 
Gaussian approximation tends to under estimate the 1% outage SINR. We also note that it has a good 
accuracy for small values of a and for enough large values of N (N > 64). 

Observe that all these simulations confirm a fact announced in Remark [2] above: compared with 
functionals of the channel singular values such as Shannon's mutual information, larger signal dimensions 
are needed to attain the asymptotic regime for quadratic forms such as the SINR (see for instance outage 
probability approximations for mutual information in [22] and in [23]). This observation holds for first 
order as well as for second order results. 

V. Proof of Theorem [3] 
This section is devoted to the proof of Theorem [3] We begin with mathematical preliminaries. 

A. Preliminaries 

The following lemma gathers useful matrix results, whose proofs can be found in [28]: 
Lemma 2: Assume X = [xjj]f^ =1 and Y are complex N x N matrices. Then 

1) For every i,j < N, \xij\ < ||X||. In particular, ||diag(X)|| < ||X||. 

2) ||XY|| < ||X|| ||Y||. 

3) For p > 0, the resolvent (XX* + pi)" 1 satisfies ||(XX* + < 

4) If Y is Hermitian nonnegative, then |tr(XY)| < ||X||tr(Y). 
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N=16 N=32 




Fig. 5. Theoretical and empirical 1% outage SINR 



Let X = UAV* be a spectral decomposition of X where A = diag(Ai,... ,A n ) is the matrix of 
singular values of X. For a real p > 1, the Schatten ^ p -norm of X is defined as ||X|| P = Af ) 1/p . The 
following bound over the Schatten £ p -norm of a triangular matrix will be of help (for a proof, see [25], 
[29, page 278]): 

Lemma 3: Let X = [xij]fj =l be a N x N complex matrix and let X = [xjjli>j]^ =1 be the strictly 
lower triangular matrix extracted from X. Then for every p > 1, there exists a constant C p depending 
on p only such that 

X|| p < Cp[|X[|p . 
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The following lemma lists some properties of the resolvent Q and the deterministic approximation 
matrix T. Its proof is postponed to Appendix |A) 
Lemma 4: The following facts hold true: 

1) Assume A2. Consider matrices Tk(— p) = diag(fi(— p), . . . ,tj\r(— p)) defined by Theorem [THI]). 
Then for every 1 < n < N, 

, 1 2 < U-P) < - • (23) 
P + ^niax P 

2) Assume in addition Al and A3. Let Qk(— p) = (YY* + pi) -1 and let matrices be as in the 
statement of Theorem [I]-©- Then 

supE|tr S K (Q K -T k )\ 2 < oo . (24) 

K 

B. Proof of Theorem |3]-([7]) 

We introduce the following notations. Assume that X is a real matrix, by X ^ we mean Xij > 
for every element X^. For a vector x, x is defined similarly. In the remainder of the paper, 
C = C(p, <r max , liminf ^, sup ^) < oo denotes a positive constant whose value may change from line 
to line. 

The following lemma, which directly follows from [21, Lemma 5.2 and Proposition 5.5], states some 
important properties of the matrices Ax defined in the statement of Theorem [3] 

Lemma 5: Assume A2 and A3. Consider matrices Ax defined by ( fT9l ). Then the following facts hold 
true: 

1) Matrix Ik — is invertible, and (Ik — A^) _1 )p= 0. 

2) Element (k, k) of the inverse satisfies [(Ik — A^)" 1 ], , > 1 for every 1 < k < K. 

3) The maximum row sum norm of the inverse satisfies limsupx \\\{Ik — ^■k)~ 1 \\\ 00 < °o. 

Due to Lemma [5HT]), Q 2 K is well defined. Let us prove that limsupx Q 2 K < oo. The first term of the 
right-hand side of (1201 ) satisfies 

-ig T (Ix - Ax^A-ig < ||g|UH(Ix - A^-'A^glU 

< iigiioo in (i^ - A^rlL iia-^Hoo < i| g ||^ ||(ix - AA-riL ( 25 ) 



October 10, 2008 



DRAFT 



A CENTRAL LIMIT THEOREM FOR THE SINR AT THE LMMSE ESTIMATOR OUTPUT 



20 



due to I A 1 ||| oo < 1. Recall that ||T|| < p 1 by Lemma H]-©. Therefore, any element of g satisfies 

1 9 JV 9 JV cr 4 

-trD D fc T 2 < - Do D fc T 2 < -^f- (26) 
K K K p z 

by A2, hence sup K ||g|| < C. From Lemma [S]-© and d25l ), we then obtain 

limsupig T (lK- A K )- 1 A- 1 g<C. (27) 

if ^ 

We can prove similarly that the second term in the right-hand side of (1201) satisfies sup^((E|iyio| 4 — 
l)^trDgT(-p) 2 ) < C. Hence limsup^e^ < oo. 
Let us prove that lim inf % Q 2 K > 0. We have 

^{Ik-Ak)- 1 *- 1 ^ > ^g T diag ((I K - Ak)- 1 ) A-'g 

(6) . < * 

> 



7 " ^2 17 £ f ^trD D fc T 2 

^trDo (t^) T2 ) 



(c) 1 

> 



M 1 ' < 

> 



itr D ^D fc 

V fc = l / 



where (a) follows from the fact that (Ijf-Ajf) 1 ^= (Lemma IS]-©, and the straightforward inequalities 
A -1 ^ and g ^ 0), (6) follows from Lemma IS]-© and ||A|| < (1 + f ^) 2 , (c) follows from the 
elementary inequality n _1 ^x 2 > (n _1 ^Xj) 2 , and (d) is due to Lemma I!]-©. Similar derivations 
yield: 

(Wo| 4 -l)^trD 2 T > f 1 W 2°t ~] f^trDoV > C(E|iy 10 | 4 - 1) 
by A3. Therefore, if A4 holds true, then liminfx Q 2 K > and Theorem [I]-© is proved. 

C. Proof of TheoremQffl 

Recall that the SINR (3k is given by Equation (fT4)) . The random variable ^-(Pk — Pk) can therefore 
be decomposed as 



K,„ — N I / — i/2 nn i/2 . I 



,,-{Pk-Pk) = w^^QD^ z w -tr(D Q) +^=— (tr(D (Q-T))) 

= U 1>K + U 2 ,k ■ (28) 
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Thanks to Lemma [J]-© and to the fact that liminf^: @ 2 K > 0, we have EC/jj- 2 < CK 1 which implies 
that Uk,2 — ► in probability as K —> oo. Hence, in order to conclude that 

\[K - 

— — (Pk - Pk) > AA(0, 1) in distribution , 

k$k K^oo 

it is sufficient by Slutsky's theorem to prove that U± : k —>■ A/"(0, 1) in distribution. The remainder of the 
section is devoted to this point. 

Remark 6: Decomposition (128T ) and the convergence to zero (in probability) of U^k yield the following 
interpretation: The fluctuations of ^/K([3k — @k) 316 mainly due to the fluctuations of vector w . Indeed 
the contribution of the fluctuationsy of -j^trDoQ, due to the random nature of Y, is negligible. 

Denote by E n the conditional expectation E n [ • ] = E[ • || W n ^, W n+ i t o, . . . , Wn,o, Y]. Put Etv+iI ■ ] = 

1 /2 1 /2 

E[ • || Y] and note that Etv+i(wqD QD Wq) = trDoQ. With these notations at hand, we have: 

1 N *T» 1 / 2 ^-Tk 1 / 2 1 N 

TT - 1 ST( W w ^ W D Q P w A 1 



n=l v n=l 

Consider the increasing sequence of a— fields 

Fn,k = <r(W Nfi , Y) , • • • , F 1>K = a(W lj0 , • • • , W N>0 , Y) . 

Then the random variable Z U; k is integrable and measurable with respect to T n ,K\ moreover it readily 
satisfies M n +iZ n K = 0. In particular, the sequence (Zj^^k, ■ ■ ■ , Z\ : k) lS a martingale difference sequence 
with respect to (jF/v,^, • • • ,T\.k)- The following CLT for martingales is the key tool to study the 
asymptotic behavior of Ui : k- 

Theorem 4: Let Xn k,Xn—i,Ki ■ ■ ■ )X\,K be a martingale difference sequence with respect to the 
increasing filtration Gn,k, ■ ■ ■ ,Gi,K- Assume that there exists a sequence of real positive numbers s 2 K 
such that 

1 N 
K n=l 

in probability. Assume further that the Lyapunov condition holds: 

1 - 

°K n=l 

'in fact, one may prove that the fluctuation of -j^trDo(Q — T) are of order K, i.e. trDo(Q — T) asymptotically behaves as 
a Gaussian random variable. Such a speed of fluctuations already appears in [21], when studying the fluctuations of the mutual 
information. 
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Then 2~2n=i X n ,i< converges in distribution to Af(0, 1) as K — > oo. 

Remark 7: This theorem is proved in [24], gathering Theorem 35.12 (which is expressed under the 
weaker Lindeberg condition) together with the arguments of Section 27 (where it is proved that Lyapunov's 
condition implies Lindeberg's condition). 



In order to prove that 

1 N 

U 1K = — — V Z nK > Af(0, 1) in distribution , (30) 

* — ' ' K^oo 
n=l 

we shall apply Theorem [4] to the sum ^ X^=i -^n,^ and the filtration {T^k)- The proof is carried out 
into four steps: 

Step 1: We first establish Lyapunov's condition. Due to the fact that liminfif @ 2 K > 0, we only 
need to show that 

N 

3a>0, y^E\Z n:K \ 2+a >0. (31) 



n=l 



Step 2: We prove that Vk = 2~2 n =i ^n+i-^n K satisfies 
Vk ~ [ ( E l m °[ 4 ~ 2 ) tr ( D 2(diag(Q)) 2 ) + ltr(D QD Q) ] > in probability . (32) 



Step 3: We first show that 

^,trDg(diag(Q)) 2 - ^trD^T 2 ► in probability. (33) 

In order to study the asymptotic behavior of -^tr(DoQD Q), we introduce the random variables Ug = 
-^tr(DoQD^Q) for < I < K (the one of interest being Uq). We then prove that the Ue's satisfy the 
following system of equations: 

K 1 

U £ = J2 c ^Uk + -trD D £ T 2 + e £ , < £ < K, (34) 

k=l 



where 

ctk = i? A - - o , 0<£<K,l<k<K (35) 



1 ^trD £ D fc T(-p) s 
^(l + ^trD fe T(-p)) 2 ' 



and the perturbations ei satisfy E|e^| < CK 2 where we recall that C is independent of I. 
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Step 4: We prove that Uq = -^trDoQDoQ satisfies 

U Q = -^trDgT 2 + lg T (I - A)" 1 A^g + e (36) 

with E|e| < CK~2. This equation combined with ([32]) and @3]> yields J] n E n+ iZ^ - Q 2 K -> in 
probability. As liminfxO|- > 0, this implies ^- ^ n E n+ iZ 2 K — ► 1 in probability, which proves (l30l) 
and thus ends the proof of Theorem [3] 



Write B = [&tf]£ =1 = QD^ and recall from © that = - E n+ i)wj5Bw . We 

have 

n-1 AT 

E n w„Bw = ^2 b u + ^ WloWe^be^ . 



ti,l 2 =n 



Hence 



J n,K 



/ TV N \ 

-7= (l^nol 2 " 1) + W^O E ^° 6 ^ + ^ E 

^ V £=n+l £=n+l / 



(37) 



Step 1: Validation of the Lyapunov condition: The following inequality will be of help to check 
Lyapunov's condition. 

Lemma 6 (Burkholder's inequality): Let be a complex martingale difference sequence with respect 
to the increasing sequence of cr-fields T^. Then for p > 2, there exists a constant C p for which 



E 



Recall Assumption Al. Eq. (I37T ) yields: 



4 1 |W„p| 2 + 1 
I < T7o I 2 ^ 2 



*2 I ^ 



A 



£=n+l 



< 



if 2 



\W n0 \ 2 + l\ +24 



P^max 



A 



=n+l 



where we use the fact that |6 nn | < (pa^ax) 1 ( £ f- Lemma EHD) an d the convexity of x 
Assumption Al, we have: 

E {\W n0 \ 2 + I) 4 < 2 3 (E|^ n0 | 8 + 1) < oo . 



(38) 
x 4 . Due to 

(39) 
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Considering the second term at the right-hand side of (1381) . we write 



E 



A 



w n0 Y 



=n+l 



ElWnofE 



N 



Y w mbrd 



£=n+l 



(a) 

< 



A 



A 



c e [ mw m \ 2 )\bni\ 2 + Y (nw eQ \ 4 m\b ni \ i 



\£=n+l 



-n+1 



(6) ( N \ N 

< C E 2 M* + E E ^ 
\ \£=n+l / £=n+l 



where (a) follows from Lemma [6] (Burkholder's inequality), the filtration being F^k, ■ ■ ■ , fn+i^K and 
(b) follows from the bound \b n e\ 4 < |6 n £| 2 max \b n g\ 2 < \b n g\ 2 '(^niaxP 1 ) 2 Lemma 0-©). Now, 
notice that 



A' 



A 



Y M 2 <Y\ b 

l=n+l 1=1 



lit 



Do /2 QD QDo /2 



< llD^QDoQD^ 2 !! < ^Sf 

nn p 



This yields E| W n0 Y^t= n +l w eoKt\ 4 < C. Gathering this result with ([39]), getting back to (|38]>, taking 
the expectation and summing up finally yields: 



A 



K < 



C 



n=l 



K K^oo 

which establishes Lyapunov's condition (I3T1) with a = 2. 
Step 2: Proof of d32]).- Eq. ([37]) yields: 



A 



A 



E n+1 Zl K = 1 (E|Ty 10 | 4 - 1) b 2 nn + E n+1 w: Y w eoKe + W n0 Y w ^tn 

\ \ £=n+l l=n+l J 

N N \ 

+26 nn (ETy 1 * |^ 10 | 2 ) Y W eo bni + 2b nn {EW lo \W 10 \ 2 ) Y ^oM 

£=n+l l=n+l / 

Note that the second term of the right-hand side writes: 

/A A \ 2 A 

E n+i W* Y w eobnt + W n0 Y w m h zn = 2 £ W £l0 w; 20 6 n£l 6 €an . 

\ £=n+l £=n+l / £i,£ 2 =n+l 

Therefore, V K = YZ=i ^n+iZ 2 K writes: 



V K 



(E|^ 10 | 4 -l) 
K 



A 



N N 



Y h nn + ^Y Y W il0 Wl 20 b nil I 



n=l 



n=l e 1 ,l 2 =n+l 



/ N N N 

\ n=l £=n+l y 
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where 5ft denotes the real part of a complex number. We introduce the following notations: 

. N N 

n=l £=n+l 

1/2 1/2 

Note in particular that R is the strictly lower triangular matrix extracted from D QD . We can now 



rewrite Vk as: 

V K = ( E l^ zA tT (D 2 (diag(Q)) 2 ) + -^w^RR*w + 25R (T K EW? \W W \ 2 ) . (40) 



We now prove that the third term of the right-hand side vanishes, and find an asymptotic equivalent for 
the second one. Using Lemma [2j we have: 

1 N N 1 

Eat+iIT^I 2 = ^ ^ bnnbmm^bnib^l^nl^m = — ^tr (diag(B)R*Rdiag(B)) 

n,m=l l=\ 



j^tr ^ /z diag(Q)D^ z R*RD^ z diag(Q)D^ 



< ||D || 2 ||Q|| 2 tr(R.*R.) < -^||D || 2 ||Q|| 2 tr(B 2 ) < ||D || 4 ||Q|| 2 tr(Q 2 ) 

< ^||D || 2 ||Q|| 4 < — — 0. 
In particular, E|r^| 2 — » and 

3t ((EWiol^iol 2 ) T K ) ► in probability . (41) 

Consider now the second term of the right-hand side of Eq. (l40b . We prove that: 

^w^RR*w - -jLr(RR*) ► in probability. (42) 



By Lemma [T] (Ineq. (fT2l). we have 



E w^RR*w - — tr(RR*)J < (E| W 10 | 4 )tr(RR*RR*) . 
Notice that tr(RR*RR*) = ||R||4 where ||R||4 is the Schatten ^4-norm of R. Using Lemma[3l we have: 



|R||| < C\\Bl /2 QBl /2 \\i < iVC7||Dy 2 QDy 2 || 4 < N— 4 



max 



Therefore, 



1 1 \ 2 N 

E ( -wjRRVo - — tr(RR*) < C 
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which implies (1421 . Now, due to the fact that B = B*, we have 

9 N N 

|trRR* = |E E M 2 

n=l £=n+l 
1 N 1 JV 

= ^ E i^i 2 -^Ei 6 ««i 2 

n,£=l n=l 

= -^trD QD Q--itrD2(diag(Q)) 2 (43) 

Gathering (|40h431) . we obtain ([32]). Step 2 is proved. 

Ste/? J: Proof of (l33l a«<i (l34b : We begin with some identities. Write Q(z) = [<Zij(#)]ij = i and 
Q(z) = [%j(^)]|j = i- Denote by the column number k of Y and by £ n the row number n of Y. 
Denote by Y fe the matrix that remains after deleting column k from Y and by Y n the matrix that remains 
after deleting row n from Y. Finally, write Qk(z) = (Y fc Y fc * — zl)~ l and Q n (z) = (Y*Y n — zl)^ 1 . 
The following formulas can be established easily (see for instance [28, §0.7.3. and §0.7.4]): 

qnn(-p) = — — ~— — — , qkk(-p) = . * - ( r — r, (44) 

H = 14k ~ 7- — ^ (43) 

i + y^Qfeyfe 

Lemma 7: The following hold true: 

1) (Rank one perturbation inequality) The resolvent Q&(— p) satisfies |trA(Q — Q&)| < ||A||/p for 
any N x N matrix A. 

2) Let Assumptions A1-A3 hold. Then, 

C 

max E(q nn (-p) - t n (-p)f < — . (46) 
l<n<N A 

The same conclusion holds true if q nn and t n are replaced with q^k and ik respectively. 



We are now in position to prove (133b - First, notice that: 

E knn~*nl = K \Qnn ~ t n \ (q nn + t n ) 



< ^{q nn -t n y^{q nn + t n ) 2 < -^{q nn -t n ) 2 . (47) 

P 
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Now, 

1 1 N 4 /V 

-E|trD 2 (diag(Q) 2 -T 2 )| < - £ < n E \q 2 nn - t\ \ < ^max^E \q^ n - i 2 | 

n=l 



2a* N 



pA y l<n<N K^oo 

where the last inequality follows from d47l ) together with Lemma IT]-©. Convergence 03l ) is established. 
We now establish the system of equations d34b . Our starting point is the identity 

Q = T + T(T~ X - Q _1 )Q = T + -^T diag(trDiT, . . . , trD iV T)Q - TYY*Q . 

Using this identity, we develop Ug = -^trDoQD^Q as 

U t = ^trDoQD £ T + -^trD QD £ Tdiag(trDiT,...,trD 7V T)Q-^trDoQD^TYY*Q 
K K K 

= Xi + X 2 - X 3 . (48) 

Lemma g]-© with S = D D^T yields: 

X 1 = -UrDoD^T 2 + ei (49) 
A 

where E|ei| < y/Ee[ < C/K. Consider now the term X 3 = ± J2k=i trD QD £ Ty fc y£Q. Using (01 
and (1431) . we have 



Hence 



Xs = ^Z)tey|QfeD QD^Ty fe 



fc=i 

K K 

= J(Y1 4yfeQfcDoQD^Ty fc + £ J^(9fc fc - 4)yfcQfeD QD £ Ty fc 

fe=l k=l 

= X' 3 + e 2 . (50) 



By Cauchy-Schwartz inequality, 

K 



E|e 2 | < -| V E (9fcfc " tfc) 2 A / E (yfcQfc D oQD£Ty fc ) 2 . 
fc=i 

We have E(y^Q fc D QD£Ty fc ) 2 < cr^ ax /r 6 E||y fe || 4 < C. Using in addition Lemma [TJ-©, we obtain 

E|e 2 | < 
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Consider X%. From (l44l ) and (|45T ). we have Q = — pqkkQkykYkQk- Hence, we can develop X' 3 as 

K o K 

r 



^* fc yfeQ fcD oQ fcD<?T y fc ~~ X/ fc * fcy fcQ feD °Q fcyfey fcQ fcD ^ TyA 



fc=i 



fc=i 



A 



X 4 + X5 . 



(51) 



Consider X4. Notice that y& and are independent. Therefore, by Lemma [T] we obtain 

y^Q fc D Q fc D^T yA; = -^trD fe Q fc D Q fc D £ T + e 3 = -^trD fe QD QD^T + e 3 + e 4 

where Ee| < CK~ l by Ineq. ([13]). Applying twice Lemma |7]-(Q} to e 4 = ^(trD fe Q fc D QfcD^T - 
trD fc QDoQD^T) yields |e 4 | < CK' 1 . Note in addition that E*fcDfc = diag(trf>iT, . . . , trfVT) 
Thus, we obtain 



P 

K 2 



tr ? k D kJ QDoQD^T + e 5 



X 2 + e 5 , 



(52) 



where £5 = e 3 + e 4 , which yields E|es| < CK 2. 

We now turn to X5. First introduce the following random variable: 

£6 = ifctoyfcQfcDoQfcyfcyfcQfcD^Tyfc _ t^kk ^-^trDfcQfcDoQ/t 

Then 



-trD fc Q fc D^T 



eel < ^yfcQfcDoQfeyfc 



y£Q fc D £ Ty fc - -trD fc Q fc D £ T 



+ - 

P 2 



yfcQfcDoQfcyfc - -^trD fe Q fc D Q fc 



-trD fe Q fc D^T 



and one can prove that E|eg| < CK I with help of Lemma [U together with Cauchy-Schwarz inequality. 
In addition, we can prove with the help of Lemma |7] that: 



tkqkk — trD fe Q fc D Qfe — trD fc Q fc D£T 



K 



1 



K 



P k I ^trD fc QDoQ ) ( ^trD fc QD £ T ) + e 7 



1 



K 
1 



1 



K 



K 
1 



4 -trD fc QDoQ — trD^T 2 + e 7 + e 8 



K 



where e 7 and es are random variables satisfying E|e 7 | < CK~? by Lemma |7J and maxjt^E|eg| < 
max^f y / E|e^p < CK~* by Lemma HJ-©. Using the fact that p 2 t\ = (1 + ^trD fc T)~ 2 , we end up 
with 

X5 = ~R^ Pk (^ trD fcQ D oQ) (-trD fc D £ T 2 j +e 9 = -^Q fc ^ + e 9 (53) 
k=i ^ ' ^ ' k=i 
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where cy. is given by (1351 ). and where E|eg| < CK 2. 

Plugging Eq. (I49l)-(l53l into (@8]), we end up with U t = Ylk=i c ekU k + ^trD D £ T 2 + e with E|e| < 
CK~z. Step 3 is established. 

Step 4 : Proof of (l36l) ; We rely on results of Section IV-BI in particular on Lemma [5] 

Define the following (K + 1) x 1 vectors: 



u = [U k ]JU, d 



-^trDoDfcT 2 



K 



k=0 



K 

k=0 > 



where the £/fc's and e^'s are defined in (l34b . Recall the definition of the c^'s for < £ < K and 
1 < k < K, define ceo = for < I < K and consider the (K + 1) x (K + 1) matrix C = [Qfc]J^ =0 . 
With these notations, System (l34l) writes 

(IiC+i - C) u = d + e . (54) 

Let a = -^trD 2 ,T 2 and [3 = (1 + ytrD T) 2 . We have in particular 

(recall that A, A and g are defined in the statement of Theorem [3]). 

Consider a square matrix X which first column is equal to [1, 0, . . . , 0] T , and partition X as X = 
Recall that the inverse of X exists if and only if Xjj exists, and in this case the first row 





a 







d = 




, c = 






g 








1 TC T 

1 x 01 



X n 

[X _1 ]o of X -1 is given by 

[X^] Q =[1 -x&XJi 1 ] 
(see for instance [28]). We now apply these results to the system (l54l . Due to (l54l . Uq can be expressed 



as 



?7o = [(I-C)- 1 ]o(d + e) 
By Lemma [J]-©, (!# — A T )~ 1 exists hence (I — C) -1 exists, 



lK+l 



and 



Uo = a + l^A" 1 (I - AT 1 g + eo + Ig^A- 1 (I - AT 1 e' 



with e' = [ei, . . . , €k] T - Gathering the estimates of Section IV-BI together with the fact that ||Ee||oo < 
CK~i, we get (l36l ). Step 4 is established, so is Theorem [3] 
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Appendix 

A. Proof of Lemma [?] 



Let us establish (1231 . The lower bound immediately follows from the representation 

1 ( a ) 1 



*n — , _ ^ „2 — o 



P + x 2^=i i+i E » ; ^ 
where (a) follows from A2 and tn{—p) > 0. The upper bound requires an extra argument: As proved in 
[26, Theorem 2.4], the i n 's are Stieltjes transforms of probability measures supported by M + , i.e. there 
exists a probability measure p n over M + such that t n (z) = j i^^Q . Thus 

7o t + p p 

and (l23l) is proved. 

We now briefly justify dHJ). We have E|trS(Q - T)| 2 = E |trS(Q - EQ)| 2 + |trS(EQ - T)| 2 . In 
[21, Lemma 6.3] it is stated that sup^ E |trS(Q — EQ)| 2 < oo. Furthermore, in the proof of [21, 
Theorem 3.3] it is shown that sup^ if||EQ - T|| < oo, hence |trS(EQ - T)| < iT||S(EQ - T)|| < 
K\\EQ - T || || S || < oo by Lemma E]-©. The result follows. 

B. Proof of Corollary \J} 

Recall that in the separable case, = c^D and D n = d n T). Let d be the K x 1 vector d = [dk]k=v 
In the separable case, Eq. (f20b is written 

% = T^g T ( J " A)" 1 A- X g + 7 (E|^io| 4 - 1) , (55) 
where 7 is defined in statement of the corollary. Here, vector g and matrix A are given by 



g = jdod and A 



^(l + itrD £ T) 2 
By the matrix inversion lemma [28], we have 



K 

t,m=l 



4-A _1 dd T . 



1 g^I-A^A-ig = ^fA-^r'a 



Kd 2 ° " ° K \ K 




A ^dcFA^ 1 d 



Noticing that 



1 1 K d 2 n 2 - 

±d T a-^ = 1 y ^ — , = p - y d\ti = p 2 7 
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we obtain 



Kdl 



g r (I 



P 2 11 
- P 2 11 



Plugging this equation into (1551 ). we obtain ((22 



C. Proof of Lemma\7\ 

The proof of Part Q] can be found in [21, Proof of Lemma 6.3] (see also [14, Lemma 2.6]). Let us 
prove Part |2] We have from Equations (|TT|) and (|44)) 



rn(-p) -tn(-p)\ 



p(l + ^trD n T)(l + £ n Q n e 



< 



Hence, 

E(f/nn tfi) 

by Lemma Q] and Lemma ID-©, which proves (l46l ). 



< ^ n Q„C - ^trD n Q^ +-| I E(trD n (Q-T)f < 



C 
A' 
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